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Abstract 



The NP-complete Permutation Pattern Matching problem asks whether a 
permutation P (the pattern) can be matched into a permutation T (the text). A 
matching is an order-preserving embedding of P into T. In the Generalized Per- 
mutation Pattern Matching problem one can additionally enforce that certain 
adjacent elements in the pattern must be mapped to adjacent elements in the text. 
' This paper studies the parameterized complexity of this more general problem. We 

, show W[l]-completeness with respect to the length of the pattern P. Under stan- 

^ ' dard complexity theoretic assumptions this implies that no fixed-parameter tractable 

O . algorithm can be found for any parameter depending solely on P. 

^ ' 1 Introduction 

1.1 Permutation patterns 

0\ 

\ The concept of pattern avoidance (and, closely related, pattern matching) in permutations 
Q\ ' arose in the late 1960ies when Donald Knuth asked which permutations could be sorted 
using one stack in an example of his Fundamental algorithms . The answer is simple: 
These are exactly the permutations avoiding the pattern 231 and they are counted by the 
Catalan numbers. By avoiding a certain pattern the following is meant: 



^ ■ Definition 1.1. Let P be a permutation of length k < n (the pattern). We say that 
. the n-permutation T (the text) contains P as a pattern or that P can be matched into 
T if we can find a subsequence of T that is order-is omorphic to P. If there is no such 
subsequence we say that T avoids the pattern P. Matching P into T thus consists in finding 
a monotone map jj. : [k] ^ [n] so that the sequence Ijl{P), being defined as (//(P(i)))^gj^j, 
is a subsequence of T. 

Example 1.2. The text T = 53142 (being a permutation written in one-line representa- 
tion) contains the pattern 312, since the entries 512 and also 534 form a 312-pattern. T 
however avoids the pattern 123 since it contains no increasing subsequence of length three. 

Since 1985, when Rodica Simion and Frank Schmidt published the first systematic 
study of Restricted Permutations ^8j, the area of pattern avoidance in permutations has 
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become a rapidly growing field of discrete mathematics, more specifically of (enumerative) 
combinatorics. 

There is a far-reaching generalization of pattern matching to so-called generalized pat- 
terns. These were introduced in |1] and have since then received a lot of attention (see [19] 
for a survey). In contrast to "classical" pattern matching, some elements of a generalized 
pattern may be forced to be mapped to adjacent elements in the text. Let us first give an 
example: 

Example 1.3. As in the previous example we consider the text T = 53142. It contains 
the generalized pattern (31)2 as shown by the entries 534. Observe that the subsequence 
(31) of the pattern is mapped to the adjacent elements 53 in the text. This would not 
have been the case for 512 and hence 512 does not form a (31)2-pattern. T avoids the 
pattern (312) since it contains no 312-pattern where all three elements are adjacent. 

Definition 1.4. A generalized pattern P of length k < n is a permutation of length k in 
which parentheses () have been inserted. These parentheses have to be non-nested, non- 
overlapping and well-matched, i.e. every parenthesis ( has to subsequently be followed by 
a ). Also each pair of parentheses has to contain at least two elements. A subsequence of 
P surrounded by a pair of parentheses is called adjacent block. 

We say that the n-permutation T (the text) contains P as a generalized pattern or that 
P can be matched into T if we can find a subsequence of T that is order-isomorphic to 
P and in which adjacent blocks are mapped to adjacent elements in the text. Matching P 
into T thus consists in finding a monotone map fx : [k] ^ [n] so that ^{P) is a subsequence 
ofT and and ^{j) are adjacent whenever i and j lie next to each other in an adjacent 
block. 

We remark that this is not the usual notation which uses dashes instead of parentheses. 
For instance (31)2 is usually written as 31 — 2, (312) as 312 and 312 as 3 — 1 — 2. We prefer 
to use the parentheses notation since it allows to write classical patterns in the "classical 
way" . 

1.2 Computational aspects 

This paper takes the viewpoint of computational complexity. Computational aspects of 
pattern avoidance, in particular the analysis of the Permutation Pattern Matching 
problem, have so far received far less attention than enumerative questions. This problem 
is defined as follows: 

(Generalized) Permutation Pattern Matching (GPPM/PPM) 
Instance: A (generalized) pattern P and a permutation T (the text). 
Question: Is there a matching of P into T? 

PPM is also known as Sub-Permutation problem in the literature. In [5j it was shown 
that the general decision problem is NP-complete. In this paper we focus on the Gen- 
eralized Permutation Pattern Matching (GPPM). To the best of our knowledge, 
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so far no work has been done on GPPM. However it is known to be NP-complete, since 
PPM clearly is a special case. 

While knowing about the NP-completeness of this problem, questions regarding its 
computational complexity remain unanswered. A natural question would be "Is the Gen- 
eralized Permutation Pattern Matching problem still computationally hard when 
the pattern is short?" . A framework able to provide an answer is parameterized complex- 
ity theory. In contrast to classical complexity theory, parameterized complexity theory 
studies the complexity of problems with respect to several parameters and not just the 
input size. Therefore an answer to aforementioned question could be found by performing 
a parameterized complexity analysis using the length k of the pattern as a parameter. 

A first step towards an algorithm for GPPM is a trivial 0{n^) brute-force algorithm. 
However, even for a moderately long pattern this algorithm is unpractical. In order to 
solve GPPM efficiently for short patterns, one could hope for a fixed-parameter tractable 
(fpt) algorithm, i.e. having a runtime of f{k) ■ poly[n) with / being a computable (ideally 
single-exponential) function. 

In this paper we show that under standard complexity theoretic assumptions, no such 
fpt algorithm exists. This is achieved by showing that GPPM is W[l]-complete with 
respect to the length of the pattern. This implies, unless FPT = W [1], that there is no fpt 
algorithm for GPPM for any parameter that depends solely on the pattern. Furthermore, 
it follows that an fpt algorithm may only be found if a parameter depending on the text 
is used. 

2 Related work 

Here we present some relevant work within the complexity analysis of PPM. 

• XP-algorithms, i.e. algorithms with runtime 0{n^^^^), are given in [Il|3]. 

• Special interest has been shown to the case of separable patterns. These are permu- 
tations that can be represented with the help of a separating tree or, equivalently, 
permutations containing neither 3142 nor 2413 as a pattern. Note that every per- 
mutation avoiding any one of the patterns 132, 231, 213 and 312 is separable. Poly- 
nomial time algorithms for the decision and counting problems as well as efficient 
algorithms for the detection of separable permutations can be found in [3l[5tll3t[T6]. 

• Other cases with restricted patterns have also been studied: 

— In case P is the identity, PPM consists of looking for an increasing subsequence 
of length k in the text which is a special case of the Longest Increasing 
Subsequence problem. This problem can be solved in ©(n log n)-time for 
sequences in general [T7] and in C'(nloglogn)-time for permutations [TjllSj. 

— For all patterns of length four PPM can be solved in 0(nlogn)-time [3]. 

— For the case that both the text and the pattern are 321-avoiding an 0{k'^n^)- 
time algorithm is presented in [12]. If only the pattern is 321-avoiding an 
0(/cn^^"'"^^)-time algorithm was found. 
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Two other problems related to PPM are: 

• The Longest Common Pattern problem is to find a longest common pattern 
between two permutations Ti and T2 i.e. a pattern P of maximal length that can 
be matched both into Ti and T2. This problem is a generalization of PPM since 
determining whether the longest common pattern between Ti and T2 is Ti is equiv- 
alent to PPM. The Longest Common Pattern problem for the case that one of 
the two permutations Ti and T2 is separable has been studied e.g. in [6]. 

• For a class of permutations X the Longest X-Subsequence (LXS) problem is 
to identify in a given permutation T its longest subsequence that is isomorphic to 
a permutation of X. In [2] polynomial time algorithms for many classes X are 
described, in general however LXS is NP-hard. 

As mentioned before, we do not know of any work that has been done on GPPM. 

3 Preliminaries 

We give the relevant definitions of parameterized complexity theory. Details can be found 
in 13^5 ■ In contrast to classical complexity theory, a parameterized complexity analysis 
studies the runtime of an algorithm with respect to several parameters and not just the 
input size |/|. Therefore every parameterized problem is a subset of S* x N, where S is 
the input alphabet. An instance of a parameterized problem consequently consists of an 
input string together with a positive integer k, the parameter. 

Definition 3.1. A parameterized problem P is fixed-parameter tractable (or in FPT) if 

there is a computable function f and a polynomial p such that there is an algorithm solving 
P in time f{k) -pd/l). Such an algorithm is called fixed-parameter tractable as well. 

We continue by defining parameterized reductions. 

Definition 3.2. Let Li,L2 C S* x N 6e two parameterized problems. An fpt-reduction 
from Li to L2 is a mapping : S* x N ^ E* x N such that 

1. (/,A;) GLi iffR{I,k)GL2. 

2. R is fixed-parameter tractable. 

3. There is a computable function g such that for R{I, k) = (/', k'), k' < g{k) holds. 

FPT is the parameterized equivalent of PTIME. Other important complexity classes in the 
framework of parameterized complexity are those in the W-hierarchy, W [1] C W [2] C .... 
For our purpose only the class W[l] is relevant. It is conjectured (and widely believed) 
that W [1] 7^ FPT (see e.g. [8j). Therefore showing W[l]-hardness can be considered as 
evidence that the problem is not fixed-parameter tractable. 

Definition 3.3. The class W [1] is defined as the class of all problems that are fpt-reducible 
to the following problem. 
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Turing Machine Acceptance 


Instance: 


A nondeterministic Turing machine with its transition table, 




an input word x and a positive integer k. 


Parameter: 


k 


Question: 


Does the Turing machine accept the input x in at most k steps? 



Definition 3.4. A parameterized problem P is in XP if there is a computable function f 
such that there is an algorithm solving P in time 0(1/1-^^^^). 

Ah the aforementioned classes are closed under fpt-reductions. The following relations 
between these complexity classes are known: 



FPT C W [1] C W [2] C . . . C XP 

Finally, [k] denotes the set {1, . . . , A;}. 

4 The W[l]-completeness result 

Theorem 4.1. GPPM is \N[1]- complete with respect to length of the pattern. 



Proof. We show W[l] -hardness by giving an fpt-reduction from the following problem to 
GPPM: 



Independent Set 


Instance: 


A graph G = {V, E) and a positive integer k. 


Parameter: 


k 


Question: 


Is there a subset S* C y of size k, so that the induced subgraph 




G[S] contains no edges? 



Let {G,k) be an Independent Set instance, where V = {vi,V2, ■■■vi} is the set of 
vertices and E = {ei, 62, ... , Cm} the set of edges. We are going to construct a GPPM 
instance (P, T). We shall first construct a pair {P',T'). P' is a generalized multiset 
pattern, i.e. a generalized pattern in which elements may occur more than once. T' is a 
permutation on a multiset. Applying Definition 11.41 to permutations on multisets means 
that in a matching repeated elements in the pattern have to be mapped to repeated 
elements in the text. Afterwards we deal with the repeated elements in order to create a 
generalized pattern and a permutation on ordinary sets and hereby obtain {P,T). 

Both the pattern and the text consist of a substring coding vertices {P resp. T) and a 
substring coding edges (P resp. T). In between these two substrings we place a separator 
block of constant length c to ensure that P is matched into T and P into T. For the 
moment, we will simply write || for this separator block, indicating that the separator 
block in the pattern has to be mapped to the one in the text. Its construction shall be 
described later on. 

We define the pattern to be 

P' = p\\p = 123... /c||(121)(131) . . . (1A;1)(232) . . . (2A:2) ... ((A: - l)k{k - 1)) 
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Figure 1: An example for the reduction of an Independent Set instance to a PPM 
instance. 

P corresponds to a list of (indices of) k vertices. P represents all possible edges between 
the k vertices (in lexicographic order). An edge from i to j is encoded by (iji). Since 
the complete graph on k vertices has k{k — l)/2 edges the total length of P is equal to 
k + c + 3k{k - l)/2 =_ c + (3fe2 - k)/2. 

For the text T' = T\\T we proceed similarly. T is a list of the (indices of the) / vertices of 
G. T represents all edges not occurring in G listed in lexicographic order. An edge 
is again encoded by iji. Let us give an example: 

Example. Let / = 6 and k = 3. Then the pattern is given by 

P' = 123||(121)(131)(232) 

Consider for instance the graph G with six vertices vi, . . . ,vq and edge-set 

{{1, 3} , {1,4} , {1, 5} , {2, 6} , {3, 4} , {3, 6} , {5, 6}} 

represented in Figure [T] (we write {i,j} instead of {vi,Vj}). Then the eight edges 
not appearing in G are 

{{1, 2} , {1, 6} , {2, 3} , {2, 4} , {2, 5} , {3, 5} , {4, 5} , {4, 6}} . 

The text is thus given by: 

T' = 123456||121 161 232 242 252 353 454 464. 

Since there are at most /(/ — 1) /2 edges not appearing in G, the length of T' is at the most 
c+(3/2-/)/2. 

Claim 1. An independent set of size k can be found in G if and only if there is a simul- 
taneous matching of P into T and of P into T. 
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Example (continuation). In our example {v2,V3,V5} is an independent set of size 
three. Indeed, the pattern P' can be matched into T' as can be seen by matching 
the elements 1,2 and 3 onto 2,3 and 5 respectively. See again Figure [1] where the 
involved vertices respectively elements of the text have been marked in red. 

Proof of Claim [TJ A matching of P into T corresponds to a selection of k vertices amongst 
the I vertices of G. If it is possible to additionally match P into T this means that there 
are no edges between the selected vertices. Observe that a matching of the generalized 
pattern P into T consists of mapping each edge between selected vertices to an edge not 
appearing in G. This is because every adjacent triple of the form iji (with i < j) in T 
corresponds to one of the non-edges of the graph. The selected k vertices thus form an 
independent set in G. Conversely, if for every possible matching of P into T defined by 
a monotone map fi : [k] ^ [I] some (xyx) in P cannot be matched into T, this means 
that {fi{x), fj.{y)} is one of the edges of G since fi{x) fi{y) fi{x) does not appear as adjacent 
elements in T. Thus, for every selection of k vertices there will always be at least one edge 
connecting two of them and therefore there is no independent set of size A; in G. □ 

In order to get rid of repeated elements, we identify every variable with a real interval: 
1 corresponds to the interval [1, 1.9], 2 to [2,2.9] and so on until finally k corresponds to 
[k,k + 0.9] (resp. / to [/,/ + 0.9]). In P and T we shall therefore replace every element 
j by the pair of elements {j + 0.9, j) (in this order). The occurrences of j in P (resp. 
T) shall then successively be replaced by real numbers in the interval [j,j + 0.9]. For 
every j, these values are chosen one after the other (from left to right), always picking a 
real number that is larger than all the previously chosen ones in the interval + 0.9]. 
Note that this construction changes the length of the pattern (resp. the text) by k (resp. 
/) since the length of P (resp. T) is doubled. We now have |P| = c + (3fc^ + k)/2 and 
|r| < c + (3/2 + /)/2. 

Observe the following: The obtained sequence is not a permutation in the classical 
sense since it consists of real numbers. However, by replacing the smallest number by 1, 
the second smallest by 2 and so on, we do obtain an ordinary permutation. This defines 
P and T (except for the seperator block). 

Example (continuation). Getting rid of repetitions in the pattern of the above 
example could for instance be done in the following way: 

P = 1.9 1 2.9 2 3.9 3||(1.1 2.1 1.2)(1.3 3.1 1.4)(2.2 3.2 2.3) 

This permutation of real numbers is order-isomorphic to the following ordinary per- 
mutation: 

P = 61B7FC|K283)(4D5)(9EA) 

For increased legibility, we use letters A, B, C,... with lexicographic order for num- 
bers larger than 9, i.e. A = 10, B = 11, and so on. 

Claim 2. P can be matched into T iff P' can be matched into T' . 
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Proof of Claim\^ Suppose that P' can be matched into T' . When matching P into T, 
we have to make sure that elements in P that were copies of some repeated element in P' 
may still be mapped to elements in T that were copies themselves in T' . Indeed this is 
possible since we have chosen the real numbers replacing repeated elements in increasing 
order. If i in P' was matched to j in T', then the pair [i + 0.9, i) in P may be matched to 
the pair (j + 0.9, j) in T and the increasing sequence of elements in the interval [i, i + 0.9] 
may be matched into the increasing sequence of elements in the interval + 0.9]. 

Now suppose that P can be matched into T. In order to prove that this implies that 
P' can be matched into T', we merely need to show that elements in P that were copies 
of some repeated element in P' have to be mapped to elements in T that were copies 
themselves in T' . Then returning to repeated elements clearly preserves the matching. 
Firstly, it is clear that a pair of consecutive elements i + 0.9 and i in P has to be matched 
to some pair of consecutive elements j + 0.9 and j in T, since j is the only element smaller 
than j + 0.9 and appearing to its right. Thus intervals are matched to intervals. Secondly, 
an element x in P for which it holds that i < x < i + 0.9 must be matched to an element 
y in T for which it holds that j < y < j + 0.9. Thus copies of an element are still matched 
to copies of some other element. 

Finally, replacing real numbers by integers clearly does not change the permutations 
in any relevant way. □ 

We now have to implement the separator block in order to ensure that the two sub- 
strings P and P of the pattern are matched into the corresponding substrings T resp. T 
in T. In P we proceed in the following way. Let us denote its largest element by Pmax 
(this is the next-to-last element of P and was denoted by (k + 0.9) in P'). Then we define 
the separator block to be the generalized pattern (Pmax + 3, Pmax + 2, Pmax + 1, -Pmax + 4). 
This leads to P = P(Pmax + 3, Pmax + 2, Pmax + 1, -Pmax + 4)P. For T we proceed similarly 
and obtain T = f{T,^^^ + 3)(Tmax + 2)(Tmax + l)(rmax + 4)f . We thus have c = 4, i.e. the 
separator block has length 4. 

Example (continuation). The largest element of P is F = 15, thus we will use the 
following three elements as separator block: G, H, I and J. Inserting these elements 
as described above leads to: 

P = 61B7FC(IHGJ)(283)(4D5)(9EA) 

Claim 3. Constructing the separator block in the described way guarantees that in a 
matching of P into T , P is matched into T and P is matched into T. 

Proof of Claim\^ The only (3214)-pattern that can be found in T, i.e. the only decreasing 
subsequence consisting of three adjacent elements followed by a larger element, is formed 
by the elements (Pmax + 3) (Pmax + 2)(rmax + l)(Tmax + 4). Thus the separator block in 
P must be mapped to the one in T. Consequently the elements lying to its left (these 
are exactly the elements of P) must be matched into the elements lying to the left of the 
separator block in T, i.e. into T. For the same reason P must be matched into T. □ 
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This finally yields that {G, k) is a YES-instance of Independet Set if and only if (P, T) is 
a YES-instance of GPPM. It remains to show that this reduction can be done in fpt-time. 
We have already remarked that the length of the pattern is equal to 4 + (SA;^ + k)/2 and 
the length of the text is at the most 4 + (3/^ + l)/2. Thus |P| = 0{k'^) and |r| = 0{l^). 

For showing membership we encode GPPM as a model checking problem of an exis- 
tential first order formula. W[l]-membership is then a consequence of the fact that the 
following problem is W[l]-complete |1U] . 

Existential first-order model checking 

Instance: A structure A and an existential first-order formula 
Parameter: \Lp\ 
Question: \s A a, model for 99? 

Let k = \P\. We compute a structure A = (^,<,T<,5'), where the domain set A = 
{1, . . . , n} represents indices in the text. r< is a binary relation where T'<(x, y) is true iff 
the element on the x-th. position in T is less than the element on position y. 5" is a binary 
relation where S{x,y) is true iff y is the successor of x, i.e. x + 1 = y. T<, S" and < can 
be computed in polynomial time. The formula ip we want to check is 

(f =3xi . . . 3xi^ xi < X2 A X2 < X3 A ... A x/^^i < A 

/\ r<(xi,Xj)A /\ ^r<(xi,Xj)A /\ S{xi,Xi+i), 

P(i)<P(j) P{i)>P{j) AdjToRight(i) 

for ij(i[k] for i,j(i[k] for i£[fe— 1] 

where AdjToRight{i) is true iff P{i)P{i + 1) is contained in an adjacent block. Observe 
that the length of ip is in 0{k'^). The correctness of the reduction follows directly from 
Definition 11.41 Indeed, the fact that P can be matched into T means that indices xi, . . . , 
where xi < . . . < Xfc can be found so that P{i) < P{j) iff T(xj) < T{xj) and so that 
Xi + 1 = Xj+i holds whenever AdjToRight{i) is true. □ 

Remark 4.2. Since in the fpt-reduction the length of the pattern can be bounded by a 
polynomial in the size of G, this is also a polynomial time reduction. Therefore the proof 
of Theorem 14. II can also be seen as an alternative way of showing NP-hardness for GPPM. 
This also follows from the NP-hardness of the less general PPM. 

5 Conclusion 

We have shown that the Generalized Permutation Pattern Matching problem is 
W[l]-complete with respect to the length of the pattern. This implies that under standard 
complexity theoretic assumptions there is no fpt-algorithm for this problem with respect 
to the length of the pattern. Furthermore, this also yields that no parameter that is 
a function of the pattern can yield fixed-parameter tractability. In order to obtain fpt 
results, we plan to study parameters concerning the text. The question whether PPM is 
also W[l]-hard remains open. 
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